COLLAGEN-BINDING PROTEINS FROM 
STREPTOCOCCUS PYOGENES 



Field of the Invention 

The present invention relates in general to proteins from group A Streptococci 
(GAS) that can bind collagen, and in particular to collagen-binding proteins designated 
Cpa1 and Cpa49, and the nucleic acid sequences coding for those proteins, which 
have been isolated from Streptococcus pyogenes and which can be used in methods to 
inhibit collagen binding and thus treat or prevent infectious diseases caused by group 
A Streptococcus bacteria. 

Background of the Invention 

The Streptococci bacteria are a pathogenic genera of microorganisms which 
have been associated with a wide variety of infectious disorders including suppuration, 
abscess formation, a variety of pyogenic infections, and septicemia. In particular, 
Streptococcus pyogenes (a group A streptococci, or GAS) is a prominent pathogen 
which causes skin and mucous membrane infections, as well as deep-seated 
connective tissue infections and severe, sometimes fatal, septicemia. Like many other 
pathogens, in order to infect the human host successfully, GAS must have the ability to 
adjust the expression of its virulence factors according to the varying conditions of 
different anatomical sites. 
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In GAS, the expression of several virul nee factors is positiv ly regulated at th 
I v I of transcription by the Mga regulator. See Perez-Casal t al. (1991); Ch n t al., 
1993; Podbielski et al. (1995) and (1996). Regulated genes include M and M-related 
proteins (phagocytosis resistance, eukaryotic cell interactions), fibronectin-related 
proteins (serum opacity factor), Spep (protease) and c5a peptidase (inactivation of 
complement factor c5a). Recent evidence has demonstrated that, in addition to iron 
levels, pH, CO2, and temperature (see Caparon et al., 1992; Podbielski et al., 1992; 
Okada et al., 1993; Mclver et al., 1995) and activity of the Mga regulator is associated 
with logarithmic and late logarithmic growth phase. See Mclver et al. (1997). 

Another regulator in Streptococcus is RofA, a positive transcriptional regulator of 
the fibronectin-binding protein (prtF) (see Fogg et al., 1994 and 1997) that promote 
bacterial attachment to the host extracellular matrix (see Hanski et al., 1992; and Van 
Heyningen et al., 1993). In contrast to Mga-controlled genes, RofA positively regulates 
prtF transcription as well as its own transcription in response to increased levels of O2 
By a potentially independent mechanism, transcription of prtF is also induced in 
response to intracellular superoxide levels (see Gibson et al., 1996). 

These data have suggested differential expression of eukaryotic cell-binding 
proteins such as RofA-dependent prtF and Mga-dependent emm in response to O2 and 
CO2 partial pressures. These observations have led to the proposal that these 
regulators may influence the expression of proteins important for the attachment of 
GAS in different in vivo environments such as superficial Langerhans cells or 
subsurface keratinocytes (Okada t al. 1994; 1995). As has been observed with regard 
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to other bacterial species, the attachment of bact ria to host cells or implanted 
biomaterials is generally initiated through "extracellular matrix prot ins," or ECM's, 
which generally refer to such general families of macromolecules, collagens, structural 
glycoproteins, proteoglycans and elastins, including fibronectin, and fibrinogen, that 
provide support and modulate cellular behavior. However, the precise role of the 
bacteria's ability to bind to these extracellular matrix proteins and the knowledge of how 
to best utilize this information in order to prevent streptococcal infection has not yet 
been fully determined. 

Moreover, outside of the two regulators RofA and Mga, very little is known with 
regard to environmentally dependent virulence gene expression in GAS, and thus there 
has been very limited information with regard to the regulation and inhibition of the 
extracellular matrix proteins that are responsible for the attachment and infection 
caused by GAS. In light of the extremely severe nature of the bacterial infections 
caused by the Streptococcal bacteria, it is extremely important to make a determination 
of which specific proteins are responsible for attachment to the surface of targeted 
cells, and to be able to use this information in order to develop vaccines and other 
biological agents which can be used to treat or prevention the severe infections 
associated with group A streptococci. 

Summary of the Invention 

Accordingly, it is an object of the present invention to provide isolated proteins 
(adhesins) from group A streptococci which can bind to intercellular matrix proteins 
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such as collagen so as to be useful in dev loping m thods of inhibiting collag n 
binding and attachment of streptococcal bacteria to cells. 

It is a further object of the present invention to provide isolated streptococcal 
surface proteins that are able to inhibit adhesion to the immobilized extracellular matrix 
or host cells present on the surface of implanted biomaterials. 

It is a further object of the present invention to provide a vaccine which can be 
used in treating or preventing infection by group A streptococcal bacterial such as 
Streptococcus pyogenes. 

It is still further an object of the present invention to generate antisera and 
antibodies to the collagen binding proteins from GAS which can also be useful in 
developing methods of treatment which can inhibit binding of the streptococcal bacteria 
to host cells or to implanted biomaterials and thus be employed in order to treat or 
prevent Streptococcal infection. 

It is a further object of the present invention to provide improved materials and 
methods for detecting and differentiating collagen-binding proteins in streptococcal 
organisms in clinical and laboratory settings. 

It is a further object of the invention to provide nucleic acid sequences which 
code for the collagen binding proteins in GAS which can also be useful in producing 
the collagen-binding proteins of the invention and in developing probes and primers 
specific for identifying and characterizing these proteins. 

These and other objects are provided by virtue of the present invention which 
comprises isolated collagen binding prot ins from group A streptococcal bact ria such 
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as Streptococcus pyogenes along with their amino acid and nucl ic acid sequ nces. 
Two of the specific proteins isolated in accordance with the invention are designated 
Cpa1 and Cpa49 which are obtained from the collagen binding region in Streptococcus 
pyogenes, and the sequences for these proteins are those as shown in SEQ ID NOS. 2 
and 4, respectively. The nucleic acid sequences coding for Cpa1 and Cpa49 are 
shown in SEQ ID NOS. 1 and 3, respectively. The isolated proteins of the present 
invention have been observed to bind to collagen, and thus can be utilized in methods 
of treating or preventing streptococcal infection through the inhibition of the ability of 
the bacteria to bind to collagen. 

In another aspect of the present invention, there is also provided antisera and 
antibodies generated against the collagen binding proteins of the present invention 
which also can be utilized in methods of treatment which involve inhibition of the 
attachment of the Cpa proteins to collagen. In particular, specific polyclonal antiserum 
against Cpa has been generated which has been shown to react with Cpa in Western 
immunoblots and ELISA assays and which interferes with Cpa binding to collagen. 
This antiserum can thus be used for specific agglutination assays to detect bacteria 
which express Cpa on their surface. The antiserum apparently does not cross-react 
with bacteria which express the fibfonectin-binding protein F1 on their surface despite 
the fact that a portion of protein F1 exhibits sequence homologies to Cpa1 and Cpa49. 

Accordingly, in accordance with the invention, antisera and antibodies raised 
against the Cpa1 and Cpa49 proteins, or portions thereof, may be employed in 
vaccines, and other pharmaceutical compositions containing the prot ins for 

5 



therapeutic purposes are also provided herein. In addition, diagnostic kits containing 
the appropriate nucl ic acid molecules, the Cpa1 or Cpa49 proteins, or antibodies or 
antisera raised against them are also provided so as to detect bacteria expressing 
these proteins. 

These embodiments and other alternatives and modifications within the spirit 
and scope of the disclosed invention will become readily apparent to those skilled in 
the art from reading the present specification and/or the references cited herein. 

BraEF Descmption of the Drawing figures 

'% Figure 1 is a schematic representation of a comparison of the nra/rofA- 

associated portions of group A streptococcal serotype M1, M6 and M49 strains. 
Results of pairwise comparisons of the deduced amino acid sequences of single ORF's 
are shown as percentage identity values between corresponding sequences. 

; * Sequence alignments were centered at the nra/rofA to prtF/cpa intergenic regions. All 
sequences are shown to scale. For designation of ORF's, see Table 1 hereinbelow. 
The M1 sequence was obtained from the GAS sequencing project (Roe et al., 1997), 
and the M6 sequence was taken from Hanski et al. (1992) and Fogg et al. (1994). The 
inserted box contains the comparison of the deduced Nra and RofA amino acid 
sequences. "." marks identical amino acid positions; "-" marks gaps that were 
introduced into the RofA sequence to maximize alignment. The underlined sequence 
marks the potential helix-turn-helix identified by Fogg et al. (1997). 
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Figure 2 depicts transcript analysis of nra and nra-regulated genes in a CAS 
wild-type (wt) and nra mutant (nra) strain. Total RNA was isolated from lat log phase 
cells grown under anaerobic (aer.) and anaerobic (anaer.) conditions. Unless 
otherwise indicated, 20 ug of total RNA was used per lane for Northern blotting. PCR- 
amplified and digoxigenin-labelled probes specific for nra, cpa, nifR3L and prtF (Table 
4) were used for hybridization. Northern analyses represent the results of transcription 
analysis of (1) the nra gene as shown in Figure 2A, (2) operons adjacent to the nra 
gene as shown in Figure 2B, and (3) the prtF gene, which is located at an unknown 
distance from nra, as shown in Figure 2C. In all cases, an increase in band intensity 
was observed using total RNA isolated from the nra mutant. With the exception of cpa, 
this increase was particularly pronounced in RNA prepared from anaerobically grown 
cultures. The nra message in the wild-type strain was expressed at very low and 
sometimes undetectable levels. 

Figure 3 depicts transcript analysis of the positive global mga regulator gene in 
GAS wild-type (wt) and nra mutant strains, and the transcript analysis of nra, nifR3L 
and cpa in GAS wild-type (wt) and mga mutant strains. Total RNA was prepared from 
mid-Jog phase cells grown under anaerobic conditions and was subjected to Northern 
blot hybridization using the indicated RNA amounts per lane. PCR-amplrfied and 
digoxigenin-labelled probes specific for mga and nra (left) or nifR3L and cpa (right) 
were used for hybridization and subsequent CSPD visualization. 

Figure 4 is a diagram of transcription and control of nra and nra-regulated 
genes. Nra exhibits negativ regulation (-) of its own xpression, that of two adjacent 



operons and of the mga regulator gene. Mga is a positiv regulator (+) of its own 
expression and that of nra. Promoters (p) and transcription terminators (ff) are shown 
in italics. For designation of ORFs, see Table 1. The sequences are drawn to scale. 

Figure 5 depicts attachment of Gas wild-type and nra mutant strains to 
immobilized human fibronectin and type I collagen. The bacteria were cultured on solid 
THY medium under anaerobic conditions until they reached stationary phase and were 
then harvested for binding assays. After FTIC labeling of the cells, adherent cells were 
detected by measuring the relative light units (RLU) present in each sample. 
Normalization of the values was performed as indicated below in the Examples section. 

Detailed Description of the Preferred Embodiments 

In accordance with the present invention, there is provided isolated collagen 
binding proteins from group A streptococcal bacteria, and their corresponding amino 
acid and nucleic acid sequences are described herein. Two specific proteins isolated in 
accordance with the present invention are designated Cpa1, having the nucleic acid 
sequence as shown in SEQ ID NO. 1 and the amino acid sequence of SEQ ID NO. 2, 
and Cpa49, which has the nucleic acid sequence as shown in SEQ ID NO. 3 and the 
amino acid sequence observed in SEQ ID No. 4. Using different experimental 
approaches, it has now been shown that Cpa1 and Cpa49 both bind to collagen, e.g., 
via binding of soluble 125-iodine labeled collagen, inhibition of binding to immobilized 
collagen by recombinant purified Cpa1 protein and by specific antisera directed to 
Cpa49 / Cpa1, and thus these prot ins or their antibodies can thus be useful in the 
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treatment and prevention of group A streptococcal disease, or in techniques to id ntrfy 
such prot ins, as described further below. It has also been det rmined via collag n 
binding experiments with recombinant purified Cpa-fragments, that the collagen binding 
domain can be deduced to reside in the third (C-terminal) quarter of the protein. 

In addition to the structures of Cpa1 and Cpa49 as shown in the amino acid 
sequences of SEQ ID NOS. 2 and 4, respectively, as would be recognized by one of 
ordinary skill in this art, modification and changes may be made in the structure of the 
peptides of the present invention and DNA segments which encode them and still 
obtain a functional molecule that encodes a protein or peptide with desirable 
characteristics. The amino scid changes may be achieved by changing the codons of 
the DNA sequence. For example, certain amino acids may be substituted for other 
amino acids in a protein structure without appreciable loss of interactive binding 
capacity with structures such as, for example, antigen-binding regions of antibodies or 
binding sites on substrate molecules. Since it is the interactive capacity and nature of 
a protein that defines that protein's biological functional activity, certain amino acid 
sequence substitutions can be made in a protein sequence, and, of course, its 
underlying DNA coding sequence, and nevertheless obtain a protein with like 
properties. It is thus contemplated by the inventors that various changes may be made 
in the peptide sequences of the disclosed compositions, or corresponding DNA 
sequences which encode said peptides without appreciable loss of their biological 
utility or activity. 
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In addition, amino acid substitutions are also possible without affecting th 
collagen binding ability of the isolated proteins of the invention, provided that the 
substitutions provide amino acids having sufficiently similar properties to the ones in 
the original sequences. 

Accordingly, acceptable amino acid substitutions are generally therefore based 
on the relative similarity of the amino acid side-chain substituents, for example, their 
hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions which 
take various of the foregoing characteristics into consideration are well known to those 
of skill in the art and include: arginine and lysine; glutamate and aspartate; serine and 
threonine; glutamine and asparagine; and valine, leucine and isoleucine. The isolated 
proteins of the present invention can be prepared in a number of suitable ways known 
in the art including typical chemical synthesis processes to prepare a sequence of 
polypeptides. 

The synthetic polypeptides of the invention can thus be prepared using the well 
known techniques of solid phase, liquid phase, or peptide condensation techniques, or 
any combination thereof, can include natural and unnatural amino acids. Amino acids 
used for peptide synthesis may be standard Boc (N*-amino protected 
NM-butyloxycarbonyl) amino acid resin with the standard deprotecting, neutralization, 
coupling and wash protocols of the original solid phase procedure of Merrifield (J. Am. 
Chem. Soc., 85:2149-2154, 1963), or the base-labile N"-amino protected 
9^iiorenylmethoxycarbonyl (Fmoc) amino acids first described by Carpino and Han {J. 
Org. Chem., 37:3403-3409, 1972). Both Fmoc and Boc N"-amino protected amino 
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acids can be obtained from Fluka, Bachem, Advanced Chemtech, Sigma, Cambridg 
Research Biochemical, Bachem, or Peninsula Labs or other chemical companies 
familiar to those who practice this art. In addition, the method of the invention can be 
used with other N a -protecting groups that are familiar to those skilled in this art. Solid 
phase peptide synthesis may be accomplished by techniques familiar to those in the art 
and provided, for example, in Stewart and Young, 1984, Solid Phase Synthesis . 
Second Edition, Pierce Chemical Co., Rockford, II; Fields and Noble, 1990, Int. J. Pept 
Protein Res. 35:161-214, or using automated synthesizers, such as sold by ABS. 
Thus, polypeptides of the invention may comprise D-amino acids, a combination of D- 
and L-amino acids, and various "designer* amino acids (e.g., 0-methyl amino acids, 
Ca-methyl amino acids, and Na-methyl amino acids, etc.) to convey special properties. 
Synthetic amino acids include ornithine for lysine, fluorophenylalanine for 
phenylalanine, and norieucine for leucine or isoleucine. Additionally, by assigning 
specific amino acids at specific coupling steps, a-helices, 0 turns, 0 sheets, y-tums, 
and cyclic peptides can be generated. 

In a further embodiment, subunits of peptides that confer useful chemical and 
structural properties will be chosen. For example, peptides comprising D-amino acids 
will be resistant to L-amino acid-specific proteases in vivo. In addition, the present 
invention envisions preparing peptides that have more well defined structural 
properties, and the use of peptidomimetics and peptidomimetic bonds, such as ester 
bonds, to prepare peptides with novel properties. In another embodiment, a peptide 
may be generated that incorporates a reduced peptide bond, i. ., Ri-CH2-NH-R 2l wh re 
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Ri and Ra are amin add residues or sequences. A reduced peptid bond may be 
introduced as a dipeptide subunit. Such a molecul would be resistant t peptid bond 
hydrolysis, e.g., protease activity. Such peptides would provide ligands with unique 
function and activity, such as extended half-lives in vivo due to resistance to metabolic 
breakdown or protease activity. It is also well known that in certain systems, 
constrained peptides show enhanced functional activity (Hruby, Life Sciences, 
31:189-199, 1982); (Hruby et at., Bkxhem J., 268:249-262, 1990). 

Also provided herein are sequences of nucleic acid molecules that selectively 
hybridize with nucleic acid molecules encoding the collagen-binding proteins of the 
invention, or portions thereof, such as consensus or variable sequence amino acid 
motifs, from Streptococcus pyogenes described herein or complementary sequences \ 
thereof. By "selective" or "selectively" is mean t a seq uence which does not hybridize 



with other nucleic acids. This is to promote specific detection of Cpa1 or Cpa49. 
Therefore, in the design of hybridizing nucleic acids, selectivity will depend upon the 
other components present in a sample. The hybridizing nucleic acid should have at 
least 70% complementarity with the segment of the nucleic acid to which it hybridizes. 
As used herein to describe nucleic acids, the term "selectively hybridizes" excludes the 
occasional randomly hybridizing nucleic acids, and thus, has the same meaning as 
"specifically hybridizing". The selectively hybridizing nucleic acids of the invention can 
have at least 70%, 80%, 85%, 90%, 95%, 97%, 98%, and 99% complementarity with 
the segment of the sequence to which they hybridize. 
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The invention contemplates sequences, probes and primers which selectiv ly 
hybridize to the encoding DNA or the complementary, or opposite, strand of DNA as 
those specifically provided herein. Specific hybridization with nucleic acid can occur 
with minor modifications or substitutions in the nucleic acid, so long as functional 
species-specific hybridization capability is maintained. By "probe" is meant nucleic 
acid sequences that can be used as probes or primers for selective hybridization with 
complementary nucleic acid sequences for their detection or amplification, which 
probes can vary in length from about 5 to 100 nucleotides, or preferably from about 10 
to 50 nucleotides, or most preferably about 18-24 nucleotides. Therefore, the terms 
"probe" or "probes" as used herein are defined to include "primers". Isolated nucleic 
acids are provided herein that selectively hybridize with the species-specific nucleic 
acids under stringent conditions and should have at least 5 nucleotides complementary 
to the sequence of interest as described by Sambrook ef a/., 1989. Molecular 
cloning: a laboratory manual, 2nd ed. Cold Spring Harbor Laboratory, Cold Spring 
Harbor, N.Y. 

If used as primers, the composition preferably includes at least two nucleic acid 
molecules which hybridize to different regions of the target molecule so as to amplify a 
desired region. Depending on the length of the probe or primer, the target region can 
range between 70% complementary bases and full complementarity and still hybridize 
under stringent conditions. For example, for the purpose of diagnosing the presence of 
the S. pyogenes, the degree of complementarity between the hybridizing nucleic acid 
(probe or primer) and the sequence to which it hybridizes ( .g., group A streptococcal 
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DNA from a sample) is at least enough to distinguish hybridization with a nucleic acid 
from other bacteria. 

The nucleic acid sequences encoding Cpa1 or Cpa49 proteins or portions 
thereof, such as consensus or variable sequence amino acid motifs, can be inserted 
into a vector, such as a plasmid, and recombinantly expressed in a living organism to 
produce recombinant Cpa1 or Cpa49 proteins or active fragments thereof. 

Recombinant proteins are produced by methods well known to those skilled in 
the art. A cloning vector, such as a plasmid or phage DNA is cleaved with a restriction 
enzyme, and the DNA sequence encoding the Cpa1 or Cpa49 protein or active 
fragments thereof, such as consensus or variable sequence amino acid motifs, is 
inserted into the cleavage site and ligated. The cloning vector is then inserted into a 
host to produce the protein or fragment encoded by the Cpa1 or Cpa49 encoding DNA. 
Suitable hosts include bacterial hosts such as Escherichia coli, Bacillus subtilis, yeasts 
and other cell cultures. Production and purification of the gene product may be 
achieved and enhanced using known molecular biology techniques. 

In accordance with the present invention, we have sequenced an 11.5 kb 
genomic fragment of serotype M49 GAS strain CS101 harboring the nra gene that is 
63% homologous to the rofA positive regulatory gene. In contrast to the apparent 
function of rofA, nra was found to encode a negative regulator affecting its own 
expression, the expression of two adjacent operons and several other genes. Some of 
these genes encode potentional intracellular proteins, whereas others encode surface 
prot ins such as the collagen-binding CPA (this study) and th fibronectin-binding 
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PrtF2 (Jaffe t al., 1996), which may be involved in virulence. In additi n, nra 
influences the expression of the mga regulatory gene and, thereby, th factors 
contained in the mga region. Expression of nra was found to be maximal in early 
stationary phase and was not significantly influenced by atmospheric conditions. 
Overall, the present invention includes the identification of a unique GAS negative 
regulator and implicates its function in a regulatory network affecting virulence factor 
expression in GAS, as set forth in detail in Podbielski et al., Molecular Microbiol. 
31 (4): 1051 -1064 (1999), incorporated herein by reference. 

In accordance with the present invention, an analysis was undertaken of the 
genomic region containing the nra gene. In this analysis, an 1 1 489 bp portion of the 
GAS chromosome was sequenced from a Lambda library of the serotype M49 GAS 
genome (GenBank accession no. U 49397). Computer analysis of this sequence 
revealed the present of nine complete and two partial predicted open reading frames 
(ORFs) (Fig. 1). Homology comparisons with GenBank entries demonstrated the 
similarity of 10 of the ORFs to known bacterial protein sequences (Table 1). Detailed 
analysis of the gene products encoded in this region (see the following sections) 
revealed the presence of a negative regulatory gene, nra, and immediately upstream in 
the opposite orientation, a collagen-binding protein, cpa. The genomes of GAS 
serotypes in GenBank and the available streptococcal serotype M1 genomic 
sequences (Roe et al., 1997) were searched for homologues to nra and cpa (Fig. 1). 
The gene sharing the highest degree of homology with nra was the positive regulatory 
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factor, rofA, whil cpa showed the highest homology t a gene for a fibronectin-binding 
protein, prtF. 

A more detailed computer analysis of the similarity between the negative 
regulator nra and the positive regulator rofA showed that both contain similar N- 
terminal double helix-tum-helix motifs (Fig. 1) whose intramolecular localization would 
be consistent with a negative or dual regulatory function of the proteins (Prag et al., 
1997). Homology between the collagen-binding cpa genes and the fibronectin-binding 
prtF genes was confined to the N4erminal sections and did not include the portions of 
prtF encoding its two fibronectin binding domains (Taley et al., 1994; Ozeri et al., 1996; 
Sela et al., 1993). The genes of fibronectin-binding proteins F have at least two 
isotypes, prtF (Hanski and Caparon, 1992) and sfb (Talay et al., 1992), which exhibit 
52% sequence homology. Similarly the genes of collagen-binding proteins, cpa, also 
appeared to have multiple forms such as cpa in M49 and cpa.1 in M1, which shared 
approximately 53% homology to each other and 23% homology to the prtF family of 
proteins. 

In order to confirm and extend the results of the sequence comparisons, 
oligonucleotides specific for prtF (Natanson et al., 1995), prtF2, cpa (M49/M1 ), nra and 
rofA genes (Table 4) were synthesized. These oligonucleotides were used as 
polymerase chain reaction (PCR) primers on genomic DNA from serotypes M1 , M2, M3, 
M4, M5, M6, M12, M18, M24 (Table 2) and eight independent M49 strains. In addition, 
the primers were used to generate probes for Southern blot hybridizations that were 
performed with EcoRI- and H/nd/0-digested genomic DNA of the 10 serotype strains 
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(Tabl 2). Based on the results from both analyses, no variation was found within the 
M49 serotype. However, different M protein serotype strains harbored ither rofA, nra 
or both genes. Any combination of regulator and binding protein (cpa, prtF, prtF2) 
could also be found. Therefore, the nrafcpa and rofA/prtF pairs are not mutually 
exclusive, and single strains can also contain any combination of regulators and 
binding proteins. What was particularly striking was that, although M49- and Mi- 
contained gene pairs had different regulatory proteins {cpa/nra and cpa. 1/rofA. 1 
respectively), the binding and regulatory genes were flanked by five genes sharing 
>98% homology and three genes with <50% homology that indicated that cpa and nra 
could be part of a pathogenicity island. In the serotype M49 strain used for further 
study, in addition to the cpa/nra gene pair, a prtF2 gene was contained in a separate 
location on the GAS chromosome. The localization of other regulator/binding protein 
pairs, especially in strains containing multiple regulators or binding proteins, awaits 
further analysis. 

The transcriptional organization of nra, cpa and flanking genes was determined 
by Northern blotting using PCR-generated specific probes (see Table 4 for primer 
sequences). Each Northern blot was repeated three or four times, and the results are 
given in Fig. 2. To determine the effect of nra on the transcription of itself and 
neighboring genes, an nra mutant was constructed by genomic insertion of the plasmid 
pFW11. The construct was confirmed by Southern blot hybridization and specific 
PCRs using nra mutant genomic DNA (data not shown). As transcription of rofA, the 
gene sharing the greatest homology to nra, is increased und r a robic conditions, th 
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Northern analyses were carried out on RNA isolated from cells grown und r both 
aerobic and anaerobic conditions. It should be noted that nra was transcribed at very 
low rates and was barely detectable in 80 ng of total RNA. 

The nra region was found to be monocistronically transcribed («1.8kb) and 
upregulated in an nra mutant. Transcription was slightly, although probably not 
significantly, induced under aerobic conditions (Fig. 2A). The three genes immediately 
downstream of nra, ORF5-nifR3L-kinL, were transcribed as an operon whose 2.6 kb 
transcript, as detected with a nifR3L probe, is shown in Fig. 2B. The ORF5-/onL operon 
was expressed at higher levels under aerobic conditions and in an nra mutant, 
suggesting that this operon falls under the control of nra. The different transcription 
rates of nUR3L in wild-type and nra mutant strains were confirmed by Northern blots 
performed on serial dilutions of total mRNA (Fig. 2B). Reverse transcriptase (RT)-PCR 
carried out on total mRNA using primers directed to the 3* end of nra and the 5' end of 
ORF5 yielded a product that would be present only if at least some transcriptional 
readthrough occurs between nra and ORF5 (data not shown). Thus, inverted repeats 
present in the non-coding section between nra and ORF5 serve only as a weak 
transcriptional terminator, allowing a small amount of readthrough between nra and 
ORF5. However, the majority of the nifR3L transcript originates from a second 
promoter upstream of ORF5, as only the ORF5-to7?L transcript could be visualized on 
the Northern blots. Because insertion of pFW1 1 in nra disrupted readthrough between 
nra and ORF5, the only promoter still present in the nra mutants was th promot r 
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ahead of 0RF5. As the ORF5-tfnL product was still increased in the nra mutants, it 
indicates that nra also has a negativ regulatory effect at the promoter immediat ly 
upstream of ORF5. 

Northern analyses using a cpa probe detected a 5.2 kb transcript composed of 
the four genes (qpa-ORF2) located immediately upstream of and in the opposite 
orientation to nra (Fig. 2B). Transcription of the cpa operon was also increased in an 
nra mutant, suggesting its regulation by nra. However, unlike the nra and ORF5-WnL 
transcripts, the cpa-ORF2 transcript was more abundant under anaerobic conditions, 
suggesting a possible superimposed second regulatory mechanism for this operon. 

Northern blots using a prtF2 probe detected an mRNA consistent in size with a 
monocistronic transcription of prtF2 (Fig. 2C). Although the gene is located at a distant 
site in the chromosome, increased transcription of an nra mutant was detected, and its 
expression is increased under aerobic conditions. However, the effects of nra mutation 
did not generally influence mRNA transcription rate or stability, as the recA transcript 
was not affected in the nra mutant (data not shown). 

As nra appeared to be a global negative regulator of virulence factors. Northern 
blots were used to determine whether nra and the global positive virulence factor 
regulator mga (Fig. 3) affected each other. Levels of mga mRNA were increased in the 
nra mutant (Podbielski et al., 1995) for Northern blot analysis, the nra message was 
found to be decreased in the mga mutant, which led to a corresponding increase in the 
nifR3L and cpa transcripts that are negativ ly regulated by nra (Fig. 3). 
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Taken together, the data from the different transcript analyses indicate that the 
nra gene product is a negativ regulator of its own expression and the two adjacent 
operons as well as of prtF2 and mga (Fig. 4). The mga regulator, in turn, was 
suggested to be a positive regulator of nra expression and, thus, an indirect suppressor 
of ma-dependent genes (Fig. 4). 

With regard to the gene coding for the collagen-binding region of the group A 
streptococci, the cpa gene was demonstrated to be negatively regulated by the nra 
gene product To determine whether CPA was involved in matrix molecule interactions, 
a recombinant CPA-maltose binding protein fusion was expressed in Escherichia coli. 
After purification and labeling, it was subjected to an enzyme-linked binding assay with 
the immobilized human matrix proteins, collagen type 1, fibronectin and laminin. Using 
the purified maltose-binding protein as a negative control, the C pa-fusion protein bound 
significantly to collagen and, to a lesser extent, to laminin (P<0.05 as determined by the 
Wilcoxon range test) (Table 3). Binding of Cpa to fibronectin and BSA remained at the 
level of the maltose-binding protein alone. Thus, like protein F2, Cpa is a second nra- 
controlled, potential GAS surface protein, exhibiting human matrix protein-binding 
properties. 

The regulation of these binding proteins by nra would predict that stationary 
phase M49 nra mutants may still contain Cpa and protein F2, as they continue to 
transcribe cpa and prtF2 upon entry into stationary phase. This could result in better 
fibronectin and collagen binding by stationary phase nra mutants. To t st this 
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predicti n, M49 wild type and nra mutant strains were cultured on plates under 
anaerobic conditions until stationary phase was reached. The cells were harvested, 
fluorescein isothiocyanate (FITC) labeled and the binding of the two strains to 
immobilized collagen and fibronectin was measured. The nra mutant exhibits 
significantly increased binding to both matrix proteins compared with the wild type (Fig. 
5). Collagen-binding assays conducted with unmarked cells that were detected with 
labeled polyclonal serum yielded similar results (data not shown), suggesting that the 
FITC-labeling protocol did not damage the cells or alter binding significantly. As 
recombinant Cpa was found to block the binding of FITC-labeled GAS to immobilized 
collagen (data not shown), the binding of cells to collagen is probably mediated through 
the interaction of Cpa and collagen. Overall, these data indicate that, while wild-type 
bacteria could decrease their affinity to matrix proteins when entering stationary growth 
phase, the nra mutants no longer had this ability. 

The organization of the genomic regions controlled by nra were remarkably 
similar to those flanking rofA (Fig. 1 ). The five downstream genes were more than 98% 
homologous. The upstream four-gene operon structure was conserved for both 
regulators. However, the homology of these genes was only 43-52% across serotypes. 
In rofA-containing M6, the first gene upstream was the fibronectin-binding protein gene, 
prtF. In the roM-containing serotype M1 and the nra-containing serotype M49, the first 
gene of the upstream operon consisted of a novel gene, cpa. Protein purification and 
binding studies showed that cpa encoded a collagen-binding protein that was unable to 
bind fibronectin. Further PCR and Southern hybridization analysis of other GAS M 
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serotypes confirmed that there was no correlation between the regulat r (nra/rofA) and 
the binding prot in contained in the upstream operon tprtF/cpa). In addition, strains 
were found that contained both regulators and/or multiple binding proteins. For 
example, serotype M49 contained an nra/cpa pair. However, a prtF2 gene located 
Isewhere in the chromosome was monocistronically transcribed and still negatively 
regulated by nra. The presence of both the positive rofA regulator and the negative nra 
regulator in the serotype M5 and the presence of only rofA in serotype M6 may explain 
the influences of genomic background noted during studies of RofA regulation in these 
serotypes (Van Heyningen et al., 1993; Fogg and Caparon, 1997). 

The expression of nra during growth was followed using a luciferase reporter 
gene fused to the 3' end of nra. The high-sensitivity detection of luciferase activity by a 
luminometer coupled with the 10 min half-life of luciferase in GAS (unpublished results) 
allowed the analysis of /uc-fusion activity even at low cell densities, nra was 
transcribed at the highest rate during early stationary phase and was not significantly 
influenced by atmospheric conditions. This was in contrast to rofA, which has been 
described as being maximally active under aerobic conditions (Fogg and Caparon, 
1997). The differences in these results could reflect either differences in sensor 
capacity between rofA and nra or a methodological difference in the assay methods 
used. The rofA measurements were done by determining the level of an accumulated 
stable 0-galactosidase reporter from a multicopy plasm id obtained using the 
experimental procedures described in the examples below. 
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In addition to the Cpa prot ins abov in various procedures, including th 
detection of the presence of Cpa1 r Cpa49 or their antibodies, the present inv ntion 
also contemplates the use of the nucleic acids described herein to detect and identify 
the presence of collagen-binding GAS as well. The methods are useful for diagnosing 
group A streptococcal infections and other streptococcal diseases such as may occur 
in catheter related infections, biomaterial related infections, respiratory tract infections, 
cardiac, gastrointestinal or central nervous system infections, ocular infections, wound 
infections, skin infections, and a myriad of other diseases including conjunctivitis, 
keratitis, cellulitis, myositis, septic arthritis, osteomyelitis, bovine mastitis, and canine 
pyoderma, all as affected by group A streptococcal bacteria. 

In accordance with the invention, a preferred method of detecting the presence 
of Cpa1 or Cpa49 proteins involves the steps of obtaining a sample suspected of 
containing group A streptococci. The sample may be taken from an individual, for 
example, from one's blood, saliva, tissues, bone, muscle, cartilage, or skin. The cells 
can then be lysed, and the DNA extracted, precipitated and amplified. Detection of 
DNA from group A streptococci can be achieved by hybridizing the amplified DNA with 
a probe for GAS that selectively hybridizes with the DNA as described above. 
Detection of hybridization is indicative of the presence of group A streptococci. 

Preferably, detection of nucleic acid (e.g. probes or primers) hybridization can 
be facilitated by the use of detectable moieties. For example, the probes can be 
labeled with biotin and used in a streptavidin-coated microtiter plate assay. Other 
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detectable moieties include radioactiv labeling, nzyme labeling, and fluorescent 
labeling, for exampl . 

DNA may be detected directly or may be amplified enzymatically using 
polymerase chain reaction (PCR) or other amplification techniques prior to analysis. 
RNA or cDNA can be similarly detected. Increased or decrease expression of Cpa1 or 
Cpa49 can be measured using any of the methods well known in the art for the 
quantification of nucleic acid molecules, such as, for example, amplification, PCR, RT- 
PCR, RNase protection, Northern blotting, and other hybridization methods. 

Diagnostic assays for Cpa1 or Cpa49 proteins or active portions thereof, such as 
consensus or variable sequence amino acid motifs, or anti-Cpal or Cpa49 antibodies 
may also be used to detect the presence of a streptococcal bacterium such as 
Streptococcus pyogenes. Assay techniques for determining protein or antibody levels 
in a sample are well known to those skilled in the art and include methods such as 
radioimmunoasssay, Western blot analysis and ELISA assays. 

The isolated, recombinant or synthetic proteins of the present invention, or 
antigenic portions thereof (including epitope-bearing fragments), or fusion proteins 
including the Cpa1 or Cpa49 proteins as described above, can be administered to 
animals as immunogens or antigens, alone or in combination with an adjuvant, for the 
production of antibodies reactive with Cpa1 or Cpa49 proteins or portions thereof. In 
addition, the proteins can be used to screen antibodies or antisera for hyperimmune 
patients from whom can be derived specific antibodies having a very high affinity for the 
proteins. 
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Antibodies to Cpa1 or Cpa49, or to fragments thereof, can also be used in 
accordance with the invention for the specific d tection of collag n-binding 
streptococcal proteins, for the prevention of infection from group A streptococci, for the 
treatment of an ongoing infection, or for use as research tools. The term "antibodies" 
as used herein includes monoclonal, polyclonal, chimeric, single chain, bispectfic, 
simianized, and humanized or primatized antibodies as well as Fab fragments, 
including the products of an Fab immunoglobulin expression library. Generation of any 
of these types of antibodies or antibody fragments is well known to those skilled in the 
art In the present case, specific polyclonal antiserum against Cpa has been generated 
which reacts with Cpa in Western immunoblots and ELISA assays and interferes with 
Cpa binding to collagen. The antiserum can be used for specific agglutination assays to 
detect bacteria which express Cpa on their surface. The antiserum does not cross-react 
with bacteria which express the fibronectin-binding protein F1 on their surface, 
although a portion of protein F1 exhibits sequence homologies to Cpa1 and Cpa49. 

Any of the above described antibodies may be labeled directly with a detectable 
label for identification and quantification of group A streptococci. Labels for use in 
immunoassays are generally known to those skilled in the art and include enzymes, 
radioisotopes, and fluorescent, luminescent and chromogenic substances, including 
colored particles such as colloidal gold or latex beads. Suitable immunoassays include 
enzyme-linked immunosorbent assays (ELISA). 

Alternatively, the antibody may be labeled indirectly by reaction with labeled 
substances that hav an affinity for immunoglobulin. The antibody may be conjugated 
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with a second substance and detected with a labeled third substance having an affinity 
for the second substance conjugated to the antibody. For exampl , the antibody may 
be conjugated to biotin and the antibody-biotin conjugate detected using labeled avidin 
or streptavidin. Similarly, the antibody may be conjugated to a hapten and the 
antibody-hapten conjugate detected using labeled anti-hapten antibody. These and 
other methods of labeling antibodies and assay conjugates are well known to those 
skilled in the art. 

Antibodies to the collagen-binding proteins Cpa1 or Cpa49, or portions thereof, 
may also be used in production facilities or laboratories to isolate additional quantities 
of the proteins, such as by affinity chromatography. For example, antibodies to the 
collagen-binding protein Cpa1 or Cpa49 may also be used to isolate additional 
amounts of collagen. 

The isolated proteins of the present invention, or active fragments thereof, and 
antibodies to the proteins may be useful for the treatment and diagnosis of group A 
streptococcal bacterial infections as described above, or for the development of anti- 
group A streptococcal vaccines for active or passive immunization. Further, when 
administered as pharmaceutical composition to a wound or used to coat medical 
devices or polymeric biomaterials in vitro and in vivo, both the proteins and the 
antibodies are useful as blocking agents to prevent or inhibit the binding of group A 
streptococci to the wound site or the biomaterials themselves. Preferably, the antibody 
is modified so that it is less immunogenic in the patient to whom it is administered. For 
example, if the patient is a human, the antibody may be "humanized" by transplanting 
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the complimentarity determining regions of the hybndorna-derived antibody into a 
human monoclonal antibody as described, .g., by Jones t a/., Nature 321:522-625 
(1986) or Tempest et al. Biotechnology 9:266-273 (1991 ). 

Medical devices or polymeric biomaterials to be coated with the antibodies, 
proteins and active fragments described herein include, but are not limited to, staples, 
sutures, replacement heart valves, cardiac assist devices, hard and soft contact lenses, 
intraocular lens implants (anterior chamber or posterior chamber), other implants such 
as corneal inlays, kerato-prostheses, vascular stents, epikeratophalia devices, 
glaucoma shunts, retinal staples, scleral buckles, dental prostheses, thyroplastic 
\ devices, laryngoplastic devices, vascular grafts, soft and hard tissue prostheses 
including, but not limited to, pumps, electrical devices including stimulators and 
recorders, auditory prostheses, pacemakers, artificial larynx, dental implants, mammary 
implants, penile implants, cranio/facial tendons, artificial joints, tendons, ligaments, 
menisci, and disks, artificial bones, artificial organs including artificial pancreas, 
artificial hearts, artificial limbs, and heart valves; stents, wires, guide wires, intravenous 
and central venous catheters, laser and balloon angioplasty devices, vascular and 
heart devices (tubes, catheters, balloons), ventricular assists, blood dialysis 
components, blood oxygenators, urethra l/ureteral/urinary devices (Foley catheters, 
stents, tubes and balloons), airway catheters (endotracheal and tracheostomy tubes 
and cuffs), enteral feeding tubes (including nasogastric, intragastric and jejunal tubes), 
wound drainage tubes, tubes used to drain the body cavities such as the pleural, 
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peritoneal, cranial, and pericardial cavities, blood bags, test tubes, blood collection 
tubes, vacutainers, syringes, needles, pipettes, pipett tips, and blood tubing. 

It will be understood by those skilled in the art that the term "coated" or "coating", 
as used herein, means to apply the protein, antibody, or active fragment to a surface of 
the device, preferably an outer surface that would be exposed to streptococcal bacterial 
infection. The surface of the device need not be entirely covered by the protein, 
antibody or active fragment 

In addition, the present invention may be utilized as immunological 
compositions, including vaccines, and other pharmaceutical compositions containing 
the Cpa1 or Cpa49 proteins or portions thereof are included within the scope of the 
present invention. Either one or both of the Cpa1 or Cpa49 proteins, or active or 
antigenic fragments thereof, or fusion proteins thereof, can be formulated and 
packaged, alone or in combination with other antigens, using methods and materials 
known to those skilled in the art for vaccines. The immunological response may be 
used therapeutically or prophylactically and may provide antibody immunity or cellular 
immunity, such as that produced by T lymphocytes. 

The immunological compositions, such as vaccines, and other pharmaceutical 
compositions can be used alone or in combination with other blocking agents to protect 
against human and animal infections caused by or exacerbated by group A 
streptococci. In particular, the compositions can be used to protect humans against 
skin infections such as impetigo and eczema, as well as mucous membrane infections 
such as tonsillopharyngitis. In addition, effectiv amounts of the compositions of th 
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present invention may be used to protect against complications caused by localized 
infections such as sinusitis, mastoiditis, parapharygeal abscesses, cellulitis, necrotizing 
fascitis, myositis, streptococcal toxic shock syndrome, pneumonitis endocarditis, 
meningitis, osteomylitis, and many other sever diseases. Further, the present 
compositions can be used to protect against nonsuppurative conditions such as acute 
rheumatic fever, acute glomerulonephritis, obsessive/compulsive neurologic disorders 
and exacerbations of forms of psoriasis such as psoriasis vulgaris. The compositions 
may also be useful as appropriate in protecting both humans and other species of 
animals where needed to combat similar group A streptococcal infections. 

To enhance immunogenicity, the proteins may be conjugated to a carrier 
molecule. Suitable immunogenic carriers include proteins, polypeptides or peptides 
such as albumin, hemocyanin, thyroglobulin and derivatives thereof, particularly bovine 
serum albumin (BSA) and keyhole limpet hemocyanin (KLH), polysaccharides, 
carbohydrates, polymers, and solid phases. Other protein derived or non-protein 
derived substances are known to those skilled in the art. An immunogenic carrier 
typically has a molecular weight of at least 1,000 Daltons, preferably greater than 
10,000 Daltons. Carrier molecules often contain a reactive group to facilitate covalent 
conjugation to the hapten. The carboxylic acid group or amine group of amino acids or 
the sugar groups of glycoproteins are often used in this manner. Carriers lacking such 
groups can often be reacted with an appropriate chemical to produce them. Preferably, 
an immune response is produced when the immunogen is injected into animals such as 
mice, rabbits, rats, goats, sheep, guinea pigs, chickens, and other animals, most 
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preferably mice and rabbits. Altemativ ly, a multipl antigenic peptid comprising 
multipl copies of the protein or polypeptide, or an antigenically or immunologically 
equivalent polypeptide may be sufficiently antigenic to improve immunogenicity without 
the use of a carrier. 

The Cpa1 or Cpa49 proteins or portions thereof, or combination of proteins, may 
be administered with an adjuvant in an amount effective to enhance the immunogenic 
response against the conjugate. At this time, the only adjuvant widely used in humans 
has been alum (aluminum phosphate or aluminum hydroxide). Saponin and its purified 
component Quil A, Freund's complete adjuvant and other adjuvants used in research 
and veterinary applications have toxicities which limit their potential use in human 
vaccines. However, chemically defined preparations such as muramyl dipeptide, 
monophosphoryl lipid A, phospholipid conjugates such as those described by 
Goodman-Snitkoff et al. J. Immunol. 147:410-415 (1991) and incorporated by reference 
herein, encapsulation of the conjugate within a proteoliposome as described by Miller 
et al., J. Exp. Med. 176:1739-1744 (1992) and incorporated by reference herein, and 
encapsulation of the protein in lipid vesicles such as Novasome™ lipid vesicles (Micro 
Vascular Systems, Inc., Nashua, NH) may also be useful. 

The term "vaccine" as used herein includes DNA vaccines in which the nucleic 
acid molecule encoding for a collagen-binding Gas protein, such as the nucleic acid 
sequences disclosed herein as SEQ ID NOS. 1 or 3, as used in a pharmaceutical 
composition is administered to a patient. For genetic immunization, suitable delivery 
methods known to those skilled in the art include direct injection of plasmid DNA into 
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muscles (W metal., Hum. Mol. Genet. 1:363, 1992), delivery of DNA complexed with 
specific protein carriers (Wu etal.,J. Biol. Chem. 264:16985, 1989), coprecipitation of 
DNA with calcium phosphate (Benvenisty and Reshef, Proc. Natl. Acad. Sti. 83:9551, 
1986), encapsulation of DNA in liposomes (Kaneda ef al., Science 243:375, 1989), 
particle bombardment (Tang ef al., Nature 356:152, 1992 and Eisenbraun et al., DNA 
Cell Biol. 12:791, 1993), and in vivo infection using cloned retroviral vectors (Seeger et 
al., Proc. Natl. Acad. Sti. 81:5849, 1984). 

In another embodiment, the invention is a polynucleotide which comprises 
contiguous nucleic acid sequences capable of being expressed to produce a gene 
5 product upon introduction of said polynucleotide into eukaryotic tissues in vivo. The 
encoded gene product preferably either acts as an immunostimulant or as an antigen 
capable of generating an immune response. Thus, the nucleic acid sequences in this 
embodiment encode an immunogenic epitope, and optionally a cytokine or a T-cell 
costimulatory element, such as a member of the B7 family of proteins. 

There are several advantages of immunization with a gene rather than its gene 
product. The first is the relative simplicity with which native or nearly native antigen can 
be presented to the immune system. Mammalian proteins expressed recombinantly in 
bacteria, yeast, or even mammalian cells often require extensive treatment to ensure 
appropriate antigenicity. A second advantage of DNA immunization is the potential for 
the immunogen to enter the MHC class I pathway and evoke a cytotoxic T cell 
response. Immunization of mice with DNA encoding the influenza A nucleoprotein (NP) 
licited a CD8* response to NP that protected mice against chall ng with h t rologous 
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strains of flu. (See Montgomery, D. L t a/., Cell Mol Biol, 43(3):285-92, 1997 and 
Ulmer, J. era/., Vaccine, 15(8):792-794, 1997.) 

Cell-mediated immunity is important in controlling infection. Since DNA 
immunization can evoke both humoral and cell-mediated immune responses, its 
greatest advantage may be that it provides a relatively simple method to survey a large 
number of S. pyogenes genes for their vaccine potential. 

Pharmaceutical compositions containing the Cpa1 or Cpa49 proteins or portions 
thereof, nucleic acid molecules, antibodies, or fragments thereof, may be formulated in 
combination with a pharmaceutical excipient or carrier such as saline, dextrose, water, 
glycerol, ethanol, other therapeutic compounds, and combinations thereof. The 
formulation should be appropriate for the mode of administration. The compositions 
are useful for interfering with, modulating, or inhibiting binding interactions between 
streptococcal bacteria and collagen on host cells. 

The amount of expressible DNA or transcribed RNA to be introduced into a 
vaccine recipient will have a very broad dosage range and may depend on the strength 
of the transcriptional and translational promoters used. In addition, the magnitude of 
the immune response may depend on the level of protein expression and on the 
immunogenicity of the expressed gene product. In general, effective dose ranges of 
about 1 ng to 5 mg, 100 ng to 2.5 mg, 1 jig to 750 ng, and preferably about 10 jig to 
300 pig of DNA is administered directly into muscle tissue. Subcutaneous injection, 
intradermal introduction, impression through the skin, and other modes of 
administration such as intraperitoneal, intravenous, or inhalation deliv ry are also 
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suitabl . K is also contemplated that booster vaccinations may be provided. F Mowing 
vaccination with a polynucleotide immunogen, boosting with prot in immunogens such 
as the Cpa1 or Cpa49 gene product is also contemplated. 

The polynucleotide may be "naked", that is, unassociated with any proteins, 
adjuvants or other agents which affect the recipient's immune system. In this case, it is 
desirable for the polynucleotide to be in a physiologically acceptable solution, such as, 
but not limited to, sterile saline or sterile buffered saline. Alternatively, the DNA may be 
associated with liposomes, such as lecithin liposomes or other liposomes known in the 
art, as a DNA-liposome mixture, or the DNA may be associated with an adjuvant known 
in the art to boost immune responses, such as a protein or other carrier. Agents which 
assist in the cellular uptake of DNA, such as, but not limited to, calcium ions, may also 
be used. These agents are generally referred to herein as transf action facilitating 
reagents and pharmaceutically acceptable carriers. Techniques for coating 
microprojectiles coated with polynucleotide are known in the art and are also useful in 
connection with this invention. For DNA intended for human use it may be useful to 
have the final DNA product in a pharmaceutically acceptable carrier or buffer solution. 
Pharmaceutically acceptable carriers or buffer solutions are known in the art and 
include those described in a variety of texts such as Remington's Pharmaceutical 
Sciences. 

It is recognized by those skilled in the art that an optimal dosing schedule for a 
DNA vaccination regimen may include as many as five to six, but preferably three to 
fiv , or even more preferably one to three administrations of the immunizing ntity 
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given at intervals of as few as two t four weeks, t as long as fiv to t n years, or 
occasionally at even longer intervals. 

Suitable methods of administration of any pharmaceutical composition disclosed 
in this application include, but are not limited to, topical, oral, anal, vaginal, 
intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal and intradermal 
administration. 

For topical administration, the composition is formulated in the form of an 
intment, cream, gel, lotion, drops (such as eye drops and ear drops), or solution (such 
as mouthwash). Wound or surgical dressings, sutures and aerosols may be 
impregnated with the composition. The composition may contain conventional 
additives, such as preservatives, solvents to promote penetration, and emollients. 
Topical formulations may also contain conventional carriers such as cream or ointment 
bases, ethanol, or oleyl alcohol. 

In a preferred embodiment, a vaccine is packaged in a single dosage for 
immunization by parenteral (i.e., intramuscular, intradermal or subcutaneous) 
administration or nasopharyngeal (i.e., intranasal) administration. The vaccine is most 
preferably injected intramuscularly into the deltoid muscle. The vaccine is preferably 
combined with a pharmaceutically acceptable carrier to facilitate administration. The 
carrier is usually water or a buffered saline, with or without a preservative. The vaccine 
may be lyophilized for resuspension at the time of administration or in solution. 

Microencapsulation of the protein will give a controlled release. A number of 
factors contribut to the selection of a particular polymer for microencapsulation. Th 

34 



reproducibility of polymer synthesis and the microencapsulation process, th cost of th 
microencapsulation materials and process, the toxicological profil , th requirem nts 
for variable release kinetics and the physicochemical compatibility of the polymer and 
the antigens are all factors that must be considered. Examples of useful polymers are 
polycarbonates, polyesters, polyurethanes, polyorthoesters, polyamides, poly (D,L- 
lactide-co-glycolide) (PLGA) and other biodegradable polymers. The use of PLGA for 
the controlled release of antigen is reviewed by Eldridge et al., Current Topics in 
Microbiology and Immunology, 146:59-66(1989). 

The preferred dose for human administration is from 0.01 mg/kg to 10 mg/kg, 
preferably approximately 1 mg/kg. Based on this range, equivalent dosages for heavier 
body weights can be determined. The dose should be adjusted to suit the individual to 
whom the composition is administered and will vary with age, weight and metabolism of 
the individual. The vaccine may additionally contain stabilizers or pharmaceutical ly 
acceptable preservatives, such as thimerosal (ethyl(2-mercaptobenzoate-S)mercury 
sodium salt) (Sigma Chemical Company, St. Louis, MO). 

When labeled with a detectable biomolecule or chemical, the collagen-binding 
proteins described herein are useful for purposes such as in vivo and in vitro diagnosis 
of streptococcal infections or detection of group A streptococcal bacteria. Laboratory 
research may also be facilitated through use of such protein-label conjugates. Various 
types of labels and methods of conjugating the labels to the proteins are well known to 
those skilled in the art. Several specific labels are set forth below. The labels are 
particularly useful when conjugated to a prot in such as an antibody or receptor. For 
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exampl , the protein can be conjugated to a radi label such as, but not restricted to, 
"P, 3 H, 14 C, as S, 12S I, or 131 l. Detection of a label can be by methods such as scintillation 
counting, gamma ray spectrometry or autoradiography. 

Bioluminescent labels, such as derivatives of firefly lucrferin, are also useful. 
The bioluminescent substance is covalently bound to the protein by conventional 
methods, and the labeled protein is detected when an enzyme, such as luciferase, 
catalyzes a reaction with ATP causing the bioluminescent molecule to emit photons of 
light Fluorogens may also be used to label proteins. Examples of fluorogens include 
fluorescein and derivatives, phycoerythrin, allo-phycocyanin, phycocyanin, rhodamine, 
and Texas Red. The fluorogens are generally detected by a fluorescence detector. 

The protein can alternatively be labeled with a chromogen to provide an enzyme 
or affinity label. For example, the protein can be biotinylated so that it can be utilized in 
a biotin-avidin reaction, which may also be coupled to a label such as an enzyme or 
fluorogen. For example, the protein can be labeled with peroxidase, alkaline 
phosphatase or other enzymes giving a chrornogenic or fluorogenic reaction upon 
addition of substrate. Additives such as 5-amino-2,3-dihydro-1,4-phthalazinedione 
(also known as Luminol') (Sigma Chemical Company, St. Louis, MO) and rate 
enhancers such as p-hydroxybiphenyl (also known as p-pheny I phenol) (Sigma 
Chemical Company, St Louis, MO) can be used to amplify enzymes such as 
horseradish peroxidase through a luminescent reaction; and luminogeneic or 
fluorogenic dioxetane derivatives of enzyme substrates can also be used. Such labels 
can be detected using enzyme-linked immunoassays (ELISA) or by detecting a color 
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change with the aid of a spectrophotometer. In addition, prot ins may be labeled with 
coll idal gold for use in immunoelectron microscopy in accordance with m thods well 
known to those skilled in the art. 

The location of a ligand in cells can be determined by labeling an antibody as 
described above and detecting the label in accordance with methods well known to 
those skilled in the art, such as immunofluorescence microscopy using procedures 
such as those described by Warren and Nelson (Mol. Cell. Biol., 7: 1326-1337, 1987). 

In addition to the therapeutic compositions and methods described above, the 
Cpa1 and Cpa49 proteins or active portions or fragments thereof, nucleic acid 
molecules or antibodies are useful for interfering with the initial physical interaction 
between a pathogen and mammalian host responsible for infection, such as the 
adhesion of bacteria, to mammalian extracellular matrix proteins such as collagen on 
in-dwelling devices or to extracellular matrix proteins in wounds; to block Cpa1 or 
Cpa49 protein-mediated mammalian cell invasion; to block bacterial adhesion between 
collagen and bacterial Cpa1 or Cpa49 proteins or portions thereof that mediate tissue 
damage; and, to block the normal progression of pathogenesis in infections initiated 
other than by the implantation of in-dwelling devices or surgical techniques. 

The Cpa1 or Cpa49 proteins, or active fragments thereof, are useful in a method 
for screening compounds to identify compounds that inhibit collagen binding of 
streptococci to host molecules. In accordance with the method, the compound of 
interest is combined with one or more of the Cpa1 or Cpa49 proteins or fragments 
thereof and the degree of binding of the prot in to collag noroth r xtracellular matrix 
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proteins is measured or observed. If the presence of the compound results in th 
inhibition of protein-collagen binding, for example, then the compound may be useful 
for inhibiting group A streptococci in vivo or in vitro. The method could similarly be 
used to identify compounds that promote interactions of GAS with host molecules. The 
method is particularly useful for identifying compounds having bacteriostatic or 
bacteriocidal properties. 

For example, to screen for GAS agonists or antagonists, a synthetic reaction 
mixture, a cellular compartment (such as a membrane, cell envelope or cell wall) 
containing one or more of the Cpa1 or Cpa49 proteins or fragments thereof and a 
labeled substrate or ligand of the protein is incubated in the absence or the presence of 
a compound under investigation. The ability of the compound to agonize or antagonize 
the protein is shown by a decrease in the binding of the labeled ligand or decreased 
production of substrate product. Compounds that bind well and increase the rate of 
product formation from substrate are agonists. Detection of the rate or level of 
production of product from substrate may be enhanced by use of a reporter system, 
such as a colon metric labeled substrate converted to product, a reporter gene that is 
responsive to changes in Cpa1 or Cpa49 nucleic acid or protein activity, and binding 
assays known to those skilled in the art. Competitive inhibition assays can also be 
used. 

Potential antagonists include small organic molecules, peptides, polypeptides 
and antibodies that bind to Cpa1 or Cpa49 nucleic acid molecules or proteins or 
portions thereof and thereby inhibit their activity or bind to a binding molecul (such as 
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collagen to prevent the binding of the Cpa1 or Cpa49 nucl ic acid molecules r 
proteins to its ligand. For exampl , a compound that inhibits Cpa1 or Cpa49 activity 
may be a small molecule that binds to and occupies the binding site of the Cpa1 or 
Cpa49 protein, thereby preventing binding to cellular binding molecules, to prevent 
normal biological activity. Examples of small molecules include, but are not limited to, 
small organic molecule, peptides or peptide-Jike molecules. Other potential antagonists 
include antisense molecules. Preferred antagonists include compounds related to and 
variants or derivatives of the Cpa1 or Cpa49 proteins or portions thereof. The nucleic 
acid molecules described herein may also be used to screen compounds for 
antibacterial activity. 

The invention further contemplates a kit containing one or more Cpa1 or Cpa49- 
specific nucleic acid probes, which can be used for the detection of collagen-binding 
proteins from group A streptococci in a sample, or for the diagnosis of GAS bacterial 
infections. Such a kit can also contain the appropriate reagents for hybridizing the 
probe to the sample and detecting bound probe. In an alternative embodiment, the kit 
contains antibodies specific to either or both Cpa1 and Cpa49 proteins or active 
portions thereof which can be used for the detection of group A streptococci. 

In yet another embodiment, the kit contains either or both the Cpa1 and Cpa49 
proteins, or active fragments thereof, which can be used for the detection of GAS 
bacteria or for the presence of antibodies to collagen-binding GAS proteins in a 
sample. The kits described herein may additionally contain equipment for safely 
obtaining the sampl , a vessel f r containing the reagents, a timing means, a buffer for 
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diluting the sampl , and a colorimet r, reflectometer, or standard against which a col r 
change may be measured. 

In a preferred embodiment, the reagents, including the protein or antibody, are 
lyophilized, most preferably in a single vessel. Addition of aqueous sample to the 
vessel results in solubilization of the lyophilized reagents, causing them to react. Most 
preferably, the reagents are sequentially lyophilized in a single container, in 
accordance with methods well known to those skilled in the art that minimize reaction 
by the reagents prior to addition of the sample. 
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Table 1. Sequence homologies of the ORFs of the GAS nra genomic region. 



GASORF 




Percentage 




(provisional) 


Homologous protein sequence; 


Reference . 


number/designation 


source organism 


Identity/similarity 


1 (msmRL) 


Multiple sugar metabolism regulator; 
Streptococcus mutans 


34/59 


Russell er a/. (1992) 


2 (ORF2) 


no homologous sequence identified 








c-ter minus of electron transler fiavoprote»o la; 


27/47 


Chen and Swenson (1994) 




MethytophHus methyiotrophus 




Gregg of at 0996) 


4 {iepAL) 


Signal peptidase I; 
Staphylococcus aureus 


46/67 


cpi 


Protein F; 


26/41 


HansKi and Caparon (1992) 




Streptococcus pyogenes 




Fogg etai. (1994) 


nrai 


RofA regulator of protein F; 


63/73 




Streptococcus pyogenes 




Ogasawara er at. (1994) 


5{0RF5) 


Hypothetical 31.8kDa protein in ttsH-cysK intergenic region; 


35/62 




Bacillus subtitis 






i" '1 


Nitrogenase regulator; 


32/46 


Machado etal. (1995) 


Azospifilhtm brasifense 








Hypothetical 37.1 kDa protein: 


59/75 


Ogasawara otai. (1994) 




Bacillus subtitis 






7;;<|w>l) 


dA/dG-kinase: 


57/74 


Ma er a/. (1995) 




Lactobacillus acidophilus 








Single-strand DNA-toindtrig protein; 


50/65 


Rikke era/. (1995) 




Bacillus subtlfc 






$(phcTL) 


PhenylalanyMANA synthase beta subunit; 


49/62 


Brakhage etal. (1990) 



Badttus subtitis 
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Table % Presence of nrt/rofA(-a$soaated) 
genes in selected GAS serotype strains. 



I iJ! 
i. 'l 



Serotype strain 


Regulatory genes 




Structural genes 






rvfA 


pnf 


• 

prtF2 




M1 




+ 






+ 


M2 












M3 






i- 






M4 


+ 


+ 




+ 




MS 




+ 








M6 






+ 






M12 






-** 


+ 




M18 




+ 




+ 




M24 












M49 


+ 






+ 


+ 



Genes were detected with specific probes used for genomic Southern bk>t hybridizations as 
wed as by specific PCR assays. Sequences of primers used for analytical PCRs or to generate 
probes are shown in Table 4. 

+. hybridtzation/PCR product detectable: -. no hybrtdization/PCR product detectable. 



Table 3. Human matrix protein-binding activity — — — — ______ _ 

of a recombinant Cpa protein. Collagen F&ronectm Laminin 



BSA 



Cpa/Malfus*n 037320.011 0.074-0.006 0115 ±0.036 0.049 r 0021 

ISLs, 0.104 ±0.007 0042-0.002 0.060 r 0.006 0.033x0 005 

HRPO (negative control) Q.Q4E r 0.013 0 03$ - p.p ts 0.038 -0.012 0.028 r 0006 

PWf ^l C ^: fnaft0Sa P"* 0 * f «™ ^aftose^ng 
VS££^!^ ****** 10 ho * era *' h Peroxidase (HRPO). were compared with that 
tr^J^L waS pBffQ 2? d * an EUSA 'o«™ as described in Experimental 

^^f' J?l?! UIS as 0D «» The data were analysed by the WBcoxon 

^ ^ S™" 1 ?? the Cpa-Mal fusion to collagen type I and to laminin wwj found to 
be statistically significant (P<0.05). 
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Table 4. Ust ol oligonucleotides used in this work. 



Designation 



Sequence (5' to b') 



Position numbers 



Reference 



nra FOR 
nraREV 
*ofA FOR 
fOfA REV 
cpa FOR 
cpa REV 
prtF FOR 
prtF REV 
priF2 FOR 
p*tF2 REV 
nifR3 FOR 
r'*ufR3 REV 

■ ;hra*in5 FOR 
« =nra-tns REV 
^lepa-ins FOR 

::epa-ins REV 
.;= itraLucFORl 

:JwaLuc FOR5 
^JnraLuc REV1 

Jcpa-pMAL FOR 
in cpa-pMAL REV 



ATTTTTTCTCATGTTGCTA 

GTTTAGAATGGTTTAATTG 

GCCAATAACTGAGGTAGC 

GGCTTTTGCTCTTTTAGGT 

AGTTCACAAGTTGTCTACTG 

AAATAATAGATAGCAAGCTG 

ATTAATGCCAGAGTTAGATG 

CGATTCTCTTCCACTTTG 

TACTCTGTTAAAGAAGTAACTG 

CTCAGAGTGACTTTCTGG 

GGATTTTGCCTACTACTTA 

6TGGAATATCTAAAACAGAC 

TTTTATTGGAGACTAGAAGTTTA 

AGCAAGCCACTGATTTAC 

TGCAAAAGAGGGATAAAAC 

GAAGCAGTAGACAACTTGTG 

TAAACTAAAGTAGCTTAGCA 

ATQGAACGTCATCACAAC 

CAGATACCT AAAAATAA ACG 

GCTGAAGAACAATCAGTACCA 

TTAGTCATTTTTTAACCCTTTACG 



6474-6492 
7308-7290 
141-158 
995-977 
34^5-3454 
3727-3708 
1414-1433 

2259- 2242 

2260- 2281 
3168-3151 
8443-8461 
9313-9294 

6325-6347 
7481-7464 
5932-5914 
4707-4726 
5953-5972 
8688-6705 
7930-7911 
S798-5778 
3705-3728 



This Study 
This study 
Fogg et at. (1994) 
Fogg era/ (1994) 
This study 
This study 

Hanski and Caparon (1992) 
Haneki and Caparon (1992) 
Jatte et&t. (1996) 
Jaffe et a/. (1996) 
This study 
This Study 

Tnis study 
This study 
This study 
This study 
This study 
This study 
This study 
This study 
This study 



IRT-nra FOR 
:i=NT-nra REV 
^RT-orf5 FOR 

-RT-orfSREV 



CTT T TTACTTATT AAGAG ATGA 
CTCGTTTAGAAAATCTTG 
AA A ATAAT TA AATCA ATAGCA 
CCACAGAGATAATGTGT 



7669*7690 
7886-7869 
8030-8050 
8258-8241 



This study 
This study 
This study 
This study 



S^TJ^T ?f t0 dmp,ify probM ,or Soo,hdm 80d Nor,hem b,ot hybridizations (A), genomic fragments tor ctoninq 

2^!^I ' P 11 '^' pMAL " c2 plasmrtte < B > ** RT ~ P <* to ««ect nra- and orfS-specific transits (C) ™ ^ 

mSSS^ ^ F0R/REV; ftraLuc FOft/ « Ev ™* co*pMAL FOR/REV ware 5' extended with SphUSpe\ Nhelftemhl 

SS^' JESSf ? **** <* *» PCR ^«s The nucleotide posits numbers reS to the <msm 

genomic sequence as submitted to GenBenk or ma cited publications. " 
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EXAMPLES 

The following examples are provided which exemplify aspects of the preferred 
embodiments of the present invention. It should be appreciated by those of skill in the 
art that the techniques disclosed in the examples which follow represent techniques 
discovered by the inventors to function well in the practice of the invention, and thus 
can be considered to constitute preferred modes for its practice. However, those of 
skill in the art should, in light of the present disclosure, appreciate that many changes 
can be made in the specific embodiments which are disclosed and still obtain a like or 
similar result without departing from the spirit and scope of the invention. 

EXAMPLE 1: ISOLATION OF GROUP A STREPTOCOCCAL PROTEINS 

A Bacterial strains and culture conditions 

GAS serotype M49 strain CS101 was provided by P. Cleary, MN, USA. 
Serotypes M1, M2, M3, M4, M5, M6, M12, M18 and M24 GAS strains T1/195/2, 
T2/447RB4.119, B93076072, 75-194, T5B/126/3, S43/192/1, T12/126/4, J17C/55/1 and 
71-694 were obtained from D. Johnson, MN, USA. The M49 GAS isolates B737/137/1 , 
49-49/123, 88-299, 90-053, 90-397, 89-288, 90-306 and 8314/1945 have been 
described by Kaufhold ef a/. (1992). £ coli strain Blue MRF served as a host for phage 
Lambda ZAP Express. E.coli stain DH5a was used as host for plasmids pFW11 and 
pMAL-cZ 

E. coli DH5a isolates transformed with pFW11 or pMAL-c2 d rivativ s were 
grown on disk sensitivity testing agar (Unipath) suppl m nted with 100 mglM 
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specUnomyocin or 50 mgl* 1 ampicillin respectiv ly. E. coli Blue MRF strains infected 
with recombinant lambda phages were grown in NZ casamino acids/yeast extract (NZY) 
agar according to the instructions of the supplier (Stratagene). All E. coli cultures were 
grown in cultures were grown at 37°C in ambient air. 

GAS strains were cultured in TH broth and on TH agar (Unipath) both 
supplemented with 0.5% yeast extract (THY), or in chemically defined medium (COM) 
(van de Rijn and Kessler, 1980). The GAS mutant strains were maintained in medium 
containing 60 mgl" 1 spectinomycin. Culture conditions for GAS strains were a 
temperature of 37°C and a 5% CO2/20% 02 atmosphere unless specifically described. 

B. Vectors 

E. coli phage Lambda ZAP Express (BamH\ arms, CIP treated) was purchased 
from Stratagene and used according to the instructions of the manufacturer. 

Plasmid pFW11 was used for insertional mutagenesis as described by 
Podbielski et al. (1996c). Plasmid pFW11 multiple cloning site (MCS) 1. The 
luciferase (luc) box was amplified by PCR using plasmid pUSL2/5 (Grafe et al., 1996) 
as template and oligonucleotides lucFor 

(5'GACGATCTCGAGGAGGTAAATGAAGACGCCAAAAAC-3') and lucRev 
(S'GACGATAAGCTTTTACAATTTGGACTTTCCG-S' ) as primers. The luciferase box 
contained an optimized Shine-Oalgamo sequence as well as the luc start and stop 
codons. Cloning of GAS genomic fragm nts into MCS1 of pFW11-luc followed the 
protocol outlined by Podbi Iski et I. (1996c). 
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Plasmid pMAL-c2 was used for expression of the cpa g n and was purchased 
from New England Biolabs. It was used according to the instructions of the 
manufacturer. 

C. DNA techniques 

Chromosomal GAS DNA was prepared by the method of Martin et al. (1990). 
Plasmid DNA preparations and genetic manipulations as well as other conventional 
DNA techniques were performed as described by Ausabel et al. (1990). 
Transformation of GAS strains by electroporation was according to the protocol of 
Caparon and Scott (1991). 

Usage of the serotype M49 GAS Lambda library for sequencing of recombinant 
GAS genomic DNA followed the protocol of Podbielski et al. (1996b). Oligonucleotides 
used for sequencing and PCR were designed with the aid of OLIGO 5.0 (National 
Biosciences), synthesized on an OLIGO 1000 DNA synthesizer (Beckman) and 
desalted through NAP5 columns (Pharmacia). The parameters of PCR assays, direct 
labeling of PCR products with DIG-dUTP, analysis of PCR products and parameters for 
direct sequencing of PCR products were as described previously (Podbielski et al., 
1995). 

DNA sequences were compiled and analyzed with PC GENE 6.8 
(IntelliGenetics). Sequence comparisons were performed using the BLAST programs 
and the databases of the Gen Bank data library as well as the Streptococcal Genome 
Sequencing Project of the University of Oklahoma, USA (Roe etal., 1997). 
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D. RNA preparation and analysis 

For RNA preparations, serotype M49 GAS strains were grown aerobically to 
ODan values of 0.2, 0.5, and 0.9, which corresponded to early, medium and late 
logarithmic growth phases respectively. Before preparation, cells were sedimented by 
2 min centrifugation at 4°C, suspending in ice-cold 20mM Tris (pH 7.5)/5mM 
MgCl2/20mM sodium azide/400 mgl' 1 chloramphenicol. RNA preparation followed the 
protocol of Shaw and Clewed (1985). Denaturing agarose gel electrophoresis and 
Northern blot hybridizations with DIG-dUTP-labeled probes were performed as 
described previously (Pidbielski et a/., 1995). Probes were generated by asymmetric 
PCR, using only 10' 2 to 10" 3 of the normal amounts of the appropriate upstream 
primers. 

RT-PCR was performed with RNA after 30 min exposure to DNase I according 
to the manufacturer's protocol (Boehringer Mannheim). Reverse transcription using 
Superscript II RT-polymerase was done as described by the manufacturer (Gibco BRL) 
using the appropriate downstream primers (Table 4). One microlite of the RT assay 
was used as template for PCR employing the PCR primers listed in Table 4. Controls 
included primer control with genomic DNA template, reagent contamination control by 
running both reactions without RNA template, and DNA contamination control by 
running both reaction without RT-polymerase. 
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£. DNA mutagenesis experiments 

Insertional inactivatlon of the nm gene was performed using a recombinant 
pFW11 plasmid following the strategy and specific methods according to Podbielski et 
al. (1996c). The primers nra-insFOR/REV and cpa-insFOR/REV annealing to nra and 
cpa internal sequences were used to generate PCR products, which were cloned into 
pFW1 1 via the Sphl/Spel or NheUBamHl sites of MCS1 . Specific integration of the nra 
recombinant plasmid into the GAS genome was confirmed by Southern blot 
hybridization using BamHI-, Spef- and XbaAdigested genomic DNA and probes specific 
for the integrated antibiotic resistance marker aad9 as well as for the duplicated nra 
sequence. 

Construction of the nra pfomoteMutiferase fusions was performed using plasmid 
pFW1 1-Juc (this study) and PCR products comprising the 3' end of the nra gene or the 
entire nra promoter and structural gene region. For amplification of the PCR product, 
primers nraLucFOR5 or nraLucFORI, and nraLucREVI were used (Table 4). The 
primers annealed in the central region of the nra gene or immediately upstream of the 
cpa gene (Fig. 1) and at the stop codon of the nra gene. Using Nhel and BamHI sites 
as 5* tags for the upstream and downstream primers, respectively, the resulting PCR 
products were cloned into the corresponding MCS1 site of pFW11-luc. Specific 
integration of the plasmid in the GAS genome was confirmed as shown. 
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F. Measuring adherence to immobilized human matrix proteins 

Cells grown on solid medium were prepared by spreading a 10 ul aliquot of 
oversight cultures onto fresh THY agar plates and incubating the plates overnight in 
ambient air, 5% CO2 or anerobic incubators. Plates were then flooded with 3 ml of 
DPBS, pH 7.4 (PBS plus 0.88 mM CaCl2/0.45 mM MgCI 2 ) and incubated for 10 min at 
room temperature. Cells were suspended gently using a glass spreader, removed from 
the plate with a pipette avoiding the production of air bubbles and transferred into a test 
tube. Cells were then suspended by gentle, repeated pipetting. 

Labeling of bacteria and adhesion assays followed a protocol of Geelen et al., 
(1993). Specifically, for labeling of bacteria, thoroughly suspended cells were washed 
in 12 ml of DPBS and suspended in 2 ml of FITC solution (1 mg ml' 1 FITC in 50 mM 
sodium carbonate buffer, pH 9.2, stored in the dark and passed through a 0.2 urn pore 
size filter before use). After 20 min incubation at room temperature in the dark, cells 
were sedimented by centrifugation, washed in DPBS, suspended in 2 ml of DPBS and 
sonicated for 20 s at setting 4 in the refrigerated hollow horn of the sonifier 450 
(Branson Ultrasonic). The ODeoo values of the suspension were adjusted to 1.0 with 
DPBS, sonicated again to disruption of aggregates and kept in the dark until used. 

For immobilization of human matrix proteins, Terasaki microtitre plates were 
washed once with DPBS, pH 7.4. Then, 10 ml of 100 ng ml" 1 human fibronectin or 
collagen type 1 (Gibco BRL) was added to the wells and incubated ov might at room 
temperature in a moist chamber. 
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The preincubated Terasaki microtitre plates were washed with DPBS, and 
residual buffer was carefully removed. Then, 10 til aliquots of FITC-labeled cell 
suspensions were added to the wells and incubated for 60 min at 37°C in a 5% 
COa/20% O2 atmosphere. The plates were then washed five times with DPBS, and 
bound cells were fixed by flooding plates with 0.5% glutaraldehyde for 5 min. The 
plates were again washed twice with DPBS and kept in the dark until measured. The 
intensity of FITC labeling was controlled for each assay by measuring the fluorescence 
intensity of 10 nl aliquots of cells added in triplicate to uncoated DPBS-washed 
Terasaki microtitre plates and directly counted. 

Fluorescence of single wells was evaluated by processing the plates through an 
automated Cyto Fluor II fluorescence reader (PerSeptive Biosystems) operating with 
excitation and detection wavelengths of 485 nm and 530 nm respectively. Sensitivity 
gain levels of 72 or 62 were used for binding assays and FITC-labeling control 
respectively. 

For each assay, adherence to a human protein was measured for at least two 
coated plates and four replicate wells each located at different positions on the plates. 
For both matrix proteins, the assays were repeated at least four times on different days. 
To normalize the data, the following calculations were carried out. 

The four duplicates on a given plate were averaged to give a single value ("ave- 
RLU'). The ave-RLU values from the nra mutants on each plate were corrected for 
differences in FITC labeling intensity as follows: 

50 



ave-RLU x [(wild-type strain intensity of labeling)/(mutant strain int nsity of labeling)]. 

The maximum difference for intensity of labeling was less than a factor of 2. 
Standardization across experiments was accomplished by multiplying all values by a 
standardization factor. This standardization factor was derived by comparing all 
subsequent experiments to the first experiment using the following scheme: 

(wild-type strain intensity of labeling in assay no. 1)/(wild-type strain intensity of 
labeling in assay no. Y). 

Once calculated, all values derived in experiment Y were multiplied by the 
standardization factor. 

For comparison, unlabeled bacteria were tested for adherence to collagen type I 
and detected by a rabbit polyclonal anti-group A carbohydroxide antiserum as 
described by Gubbe (1 997 j. 

EXAMPLE 2: Expression of a recombinant CPA protein and determination of its 
matrix protein-binding properties 

The entire cpa gene except for its leader peptide encoding portion was amplified 
by PCR using the primers cpa-pMAL FOR and cpa-pMAL REV (Table 4). The resulting 
product was cloned into the BamHI and Pst1 sites of plasmid pMAL-c2. Expression in 
the presence of 2 mM IPTG with an induction period of 4h and subsequent non- 
denaturing preparation followed a protocol of Ausubel et al. (1990). Purification of th 
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recombinant CPA-maltose binding fusion protein using a composit amylose/agarose 
matrix performed according to the instructions of the manufacturer (New England 
Biolabs). The purified fusion protein was then labeled with peroxidase as described by 
Schmidt etal. (1993). 

Microtitre plates (96-well, flat-bottom; Nunc) were coated with BSA and human 
fibronectin, type I collagen or laminin (Gibco BRL) by adding 2 tig of each protein 
dissolved in 200 nl of 50 mM sodium carbonate, pH 8.6, to single wells. The wells were 
washed with PBS, pH 7.8, plus 0.5%. Tween 20 and blocked with 0.01% Tween 20 
(PBS-T). 

PeroxkJase-labeled Cpa-maltose binding protein fusion and recombinant purified 
maltose-binding protein (for control of specific binding of the bacteria) were added to 
the wells for 2 h at room temperature. Non-conjugated peroxidase at a 1 :300 dilution in 
PBS-T was used as a negative control. After washing with PBS-T, all wells were 
incubated with ortho-phenylenediamine (Sigma) and measured in an ELISA reader 
(SLT RainBou) set at 492 nm detection wavelength as outlined by Tijssen (1985). All 
assays were repeated on at least three independent occasions. 
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